CDS

Accession Number TCMCG075C22201
gbkey CDS
Protein Id XP_007021896.2
Location join(7922541..7922949,7923319..7923911)
Gene LOC18594320
GeneID 18594320
Organism Theobroma cacao

Protein

Length 333aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007021834.2
Definition PREDICTED: zingipain-1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category O
Description cysteine-type peptidase activity
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00536        [VIEW IN KEGG]
ko00537        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03110        [VIEW IN KEGG]
ko04147        [VIEW IN KEGG]
KEGG_ko ko:K01363        [VIEW IN KEGG]
ko:K01365        [VIEW IN KEGG]
ko:K01366        [VIEW IN KEGG]
ko:K16290        [VIEW IN KEGG]
ko:K16292        [VIEW IN KEGG]
EC 3.4.22.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
3.4.22.15        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
3.4.22.16        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04140        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
ko04145        [VIEW IN KEGG]
ko04210        [VIEW IN KEGG]
ko04612        [VIEW IN KEGG]
ko04621        [VIEW IN KEGG]
ko04924        [VIEW IN KEGG]
ko05205        [VIEW IN KEGG]
ko05323        [VIEW IN KEGG]
ko05418        [VIEW IN KEGG]
map04140        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
map04145        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
map04612        [VIEW IN KEGG]
map04621        [VIEW IN KEGG]
map04924        [VIEW IN KEGG]
map05205        [VIEW IN KEGG]
map05323        [VIEW IN KEGG]
map05418        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGACTTTAAACTTCGTCTTGTTTTTCGTTTTTGGGATTTTGGCATCTCAAGCCATGTCCCGCACAATGAATGAAGCTGCCATTGCTGAACACGAGCTATGGATGGCTAAATATGGTCGAACTTATCAGACCAAAACTGAGAAAGACAGACGTTTCAAGATATTCAAGGAGAACTTGGAATACATTCAGAACTTCAACAATGCTGGAAATAGGAGTTATAAGTTAGGCATTAATGAGTTTGCAGATATGAGCCATGATGAATTCGTTGCGGCTCGTACTGGATACAAGAATCCAGGTAACCTAGCAACATCATCACCATTTAGCTATGCAGAGTTTACAGATGTTCCAACAAGCTTGGATTGGAGGGAAAACGGCGCTGTCACCGCTGTTAAGGACCAGGGAGATTGTGGATGTTGTTGGGCATTTGCCGCTGTTGCAGCCGTGGAAGGCATTAACCAAATTAAAACTGGAAAGCTGATCTCATTGTCCGAGCAACAAGTATTAGACTGCAGCACAAATGGCAACAACCATGGTTGTGGGGGTGGTTCCAAGACAGATGCCTTCCAATACATTATGCAAAACGGTGGATTAACCACAGAGGACAATTATCCATATCAAGCAACGCAAGGAGCTTGTGACAAAGAGAAGGAGACATCGCACGTTGCTGATATCAGTGATTATGCGAGGGTACCCGCCAACAGCGAGGAGGAATTACTTAAAGCTGTATCAAACCAACCTGTCACAATTAGCATTGAGGCTAGTGGAATGGACTTTAAATTTTACGAAAGTGGAATCTTCAGTGGAGATTGCGGAACTAATCTAAACCATGCTGTCACTGTTGTTGGATTTGGGACCAGCGTAGACGGAATAGATTACTGGTTGGTCAAGAATTCATGGAACCAAAGTTGGGGTGAGAATGGCTACATAAAGATGCAGAGGAATGTGGATGCCTCGGAAGGCCTCTGTGGCCTTGCCATAAGACCAGCCTATCCAATTGCATAA
Protein:  
MTLNFVLFFVFGILASQAMSRTMNEAAIAEHELWMAKYGRTYQTKTEKDRRFKIFKENLEYIQNFNNAGNRSYKLGINEFADMSHDEFVAARTGYKNPGNLATSSPFSYAEFTDVPTSLDWRENGAVTAVKDQGDCGCCWAFAAVAAVEGINQIKTGKLISLSEQQVLDCSTNGNNHGCGGGSKTDAFQYIMQNGGLTTEDNYPYQATQGACDKEKETSHVADISDYARVPANSEEELLKAVSNQPVTISIEASGMDFKFYESGIFSGDCGTNLNHAVTVVGFGTSVDGIDYWLVKNSWNQSWGENGYIKMQRNVDASEGLCGLAIRPAYPIA